VTC: Improving Video-Text Retrieval with User Comments
نویسندگان
چکیده
AbstractMulti-modal retrieval is an important problem for many applications, such as recommendation and search. Current benchmarks even datasets are often manually constructed consist of mostly clean samples where all modalities well-correlated with the content. Thus, current video-text literature largely focuses on video titles or audio transcripts, while ignoring user comments, since users tend to discuss topics only vaguely related video. Despite ubiquity comments online, there currently no multi-modal representation learning that includes comments. In this paper, we a) introduce a new dataset videos, comments; b) present attention-based mechanism allows model learn from sometimes irrelevant data c) show by using our method able better, more contextualised, representations image, representations. Project page: https://unitaryai.github.io/vtc-paper.
منابع مشابه
Examining User Interactions with Video Retrieval Systems
The Informedia group at Carnegie Mellon University has since 1994 been developing and evaluating surrogates, summary interfaces, and visualizations for accessing digital video collections containing thousands of documents, millions of shots, and terabytes of data. This paper reports on TRECVID 2005 and 2006 interactive search tasks conducted with the Informedia system by users having no knowled...
متن کاملImproving Multimedia Retrieval with a Video OCR
We present a set of experiments with a video OCR system (VOCR) tailored for video information retrieval and establish its importance in multimedia search in general and for some specific queries in particular. The system, inspired by an existing work on text detection and recognition in images, has been developed using techniques involving detailed analysis of video frames producing candidate t...
متن کاملImproving Text Summarization Using Noun Retrieval Techniques
Text Summarization and categorization have always been two of the most demanding information retrieval tasks. Deploying a generalized, multifunctional mechanism that produces good results for both of the aforementioned tasks seems to be a panacea for most of the text-based, information retrieval needs. In this paper, we present the keyword extraction techniques, exploring the effects that part ...
متن کاملImproving the Automatic Retrieval of Text Documents
This paper reports on a statistical stemming algorithm based on link analysis. Considering that a word is formed by a prefix (stem) and a suffix, the key idea is that the interlinked prefixes and suffixes form a community of sub-strings. Thus, discovering these communities means searching for the best word splits that give the best word stems. The algorithm has been used in our participation in...
متن کاملImproving Cross-Language Text Retrieval with Human Interactions
Can we expect people to be able to get information from texts in languages they cannot read? In this paper we review two relevant lines of research bearing on this question and will show how our results are being used in the design of a new Web interface for cross-language text retrieval. One line of research, “Interactive IR”, is concerned with the user interface issues for information retriev...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-19833-5_36